Mining Co-Location Patterns with Rare Events from Spatial Data Sets
نویسندگان
چکیده
A co-location pattern is a group of spatial features/events that are frequently co-located in the same region. For example, human cases of West Nile Virus often occur in regions with poor mosquito control and the presence of birds. For colocation pattern mining, previous studies often emphasize the equal participation of every spatial feature. As a result, interesting patterns involving events with substantially different frequency cannot be captured. In this paper, we address the problem of mining co-location patterns with rare spatial features. Specifically, we first propose a new measure called the maximal participation ratio (maxPR) and show that a co-location pattern with a relatively high maxPR value corresponds to a colocation pattern containing rare spatial events. Furthermore, we identify a weak monotonicity property of the maxPR measure. This property can help to develop an efficient algorithm to mine patterns with high maxPR values. As demonstrated A preliminary version of the paper appeared as [13]. The research of the second author is supported in part by Natural Sciences and Engineering Research Council of Canada under grant number 312194-05 and National Science Foundation of the United States under grant number IIS-0308001. All opinions, findings, conclusions and recommendations in this paper are those of the authors and do not necessarily reflect the views of the funding agency. Y. Huang (B) Department of Computer Science and Engineering, University of North Texas, Texas, USA e-mail: [email protected] J. Pei School of Computing Science, Simon Fraser University, Burnaby, Canada e-mail: [email protected] H. Xiong Management Science and Information Systems Department, Rutgers University, Newark, USA e-mail: [email protected] 240 Geoinformatica (2006) 10: 239–260 by our experiments, our approach is effective in identifying co-location patterns with rare events, and is efficient and scalable for large-scale data sets.
منابع مشابه
Mining Of Spatial Co-location Pattern from Spatial Datasets
Spatial data mining, or knowledge discovery in spatial database, refers to the extraction of implicit knowledge, spatial relations, or other patterns not explicitly stored in spatial databases. Spatial data mining is the process of discovering interesting characteristics and patterns that may implicitly exist in spatial database. A huge amount of spatial data and newly emerging concept of Spati...
متن کاملMining Co-location Patterns from Spatial Data Using Rulebased Approach
Co-location pattern is a group of spatial features/events that are frequently co-located in the same region. The co-location pattern discovery process finds the subsets of features frequently located together. Co-location rules are identified by spatial statistics or data mining techniques. A co-location algorithm has been used to discover the co-location patterns which possess an ant monotone ...
متن کاملHierarchical Window Centric Method of Modeling Spatial Co-Location Patterns on Spatial Database
With the growing technology, most of the focus is on spatial data mining. Spatial data mining refers to the extraction of unknown and unexpected information from spatial data sets of massive, high dimensionality and complex spatial databases. Spatial data mining is a process to discover related knowledge, potentially constructive and high utility patterns embedded in geographic Information. App...
متن کاملEvent Centric Modeling Approach in Colocation Pattern Snalysis from Spatial Data
Spatial co-location patterns are the subsets of Boolean spatial features whose instances are often located in close geographic proximity. Co-location rules can be identified by spatial statistics or data mining approaches. In data mining method, Association rule-based approaches can be used which are further divided into transaction-based approaches and distance-based approaches. Transaction-ba...
متن کاملA multiple window-based co-location pattern mining approach for various types of spatial data
Studies on spatial co-location mining required distance threshold to define spatial neighbourhood (Shashi Shekhar and Yan Huang(2001); Yoo and Shekhar (2004, 2006); Yasuhiko Morimoto(2001); Koperski and Han(1995); Ding et al. (2008)) However, it is problematical for users to choose suitable threshold values because they lack prior knowledge about spatial data. Spatial neighbourhood has been def...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- GeoInformatica
دوره 10 شماره
صفحات -
تاریخ انتشار 2006